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Speech Enhancement With Gain Limitations 
Based On Speech Activity 

Cross-Reference to Related Applications 

5 This application claims the benefit of the filing date of U.S. Provisional 

Application No. 60/1 19,279, filed February 9, 1999, and is incorporated herein by 
reference. 

Field of the Invention 

This invention relates to enhancement processing for speech coding (i.e., 
10 speech compression) systems, including low bit-rate speech coding systems 
such as MELP. 

Background of the Invention 

Low bit-rate speech coders, such as parametric speech coders, have 
improved significantly in recent years. However, low-bit rate coders still suffer 
15 from a lack of robustness in harsh acoustic environments. For example, artifacts 
introduced by low bit-rate parametric coders in medium and low signal-to-noise 
ratio (SNR) conditions can affect intelligibility of coded speech. 

Tests show that significant improvements in coded speech can be made 
when a low bit-rate speech coder is combined with a speech enhancement 

20 preprocessor. Such enhancement preprocessors typically have three main 
components: a spectral analysis/synthesis system (usually realized by a 
windowed fast Fourier transform/inverse fast Fourier transform (FFT/IFFT), a 
noise estimation process, and a spectral gain computation. The noise estimation 
process typically involves some type of voice activity detection or spectral 

25 minimum tracking technique. The computed spectral gain is applied only to the 
Fourier magnitudes of each data frame (i.e., segment) of a speech signal. An 
example of a speech enhancement preprocessor is provided in Y. Ephraim et al., 
"Speech Enhancement Using a Minimum Mean-Square Error Log-Spectral 
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Amplitude Estimator." IEEE Trans. Acoustics, Speech and Signal Processing, 
Vol. 33, pp. 443-445, April 1985, which is hereby incorporated by reference in its 
entirety. As is conventional, the spectral gain comprises individual gain values to 
be applied to the individual subbands output by the FFT process. 

5 A speech signal may be viewed as representing periods of 

articulated speech (that is, periods of "speech activity") and speech pauses. A 
pause in articulated speech results in the speech signal representing background 
- noise only, while a period of speech activity results in the speech signal 
representing both articulated speech and background noise. Enhancement 

10 preprocessors function to apply a relatively low gain during periods of speech 

pauses (since it is desirable to attenuate noise) and a higher gain during periods 
of speech (to lessen the attenuation of what has been articulated). However, 
switching from a low to a high gain value to reflect, for example, the onset of 
speech activity after a pause, and wee-versa, can result in structured "musical" 

15 (or "tonal") noise artifacts which are displeasing to the listener. In addition, 

enhancement preprocessors themselves can introduce degradations in speech 
intelligibility as can speech coders used with such preprocessors. 

To address the problem of structured musical noise, some enhancement 
preprocessors uniformly limit the gain values applied to all data frames of the 

20 speech signal. Typically, this is done by limiting an "a prion 11 signal to noise ratio 
(SNR) which is a functional input to the computation of the gain. This limitation 
on gain prevents the gain applied in certain data frames (such as data frames 
corresponding to speech pauses) from dropping too low and contributing to 
significant changes in gain between data frames (and thus, structured musical 

25 noise). However, this limitation on gain does not adequately ameliorate the 

intelligibility problem introduced by the enhancement preprocessor or the speech 
coder. 
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Summary of the invention 

The present invention overcomes the problems of the prior art to both limit 
structured musical noise and increase speech intelligibility. In the context of an 
enhancement preprocessor, an illustrative embodiment of the invention makes a 
determination of whether the speech signal to be processed represents 
articulated speech or a speech pause and forms a unique gain to be applied to 
the speech signal. The gain is unique in this context because the lowest value 
the gain may assume (i.e., its lower limit) is determined based on whether the 
speech signal is known to represent articulated speech or not. In accordance 
with this embodiment, the lower limit of the gain during periods of speech pause 
is constrained to be higher than the lower limit of the gain during periods of 
speech activity. 

In the context of this embodiment, the gain that is applied to a data frame 
of the speech signal is adaptively limited based on limited a priori SNR values. 
These a priori SNR values are limited based on (a) whether articulated speech is 
detected in the frame and (b) a long term SNR for frames representing speech. 
A voice activity detector can be used to distinguish between frames containing 
articulated speech and frames that contain speech pauses. Thus, the lower limit 
of a priori SNR values may be computed to be a first value for a frame 
representing articulated speech and a different second value, greater than the 
first value, for a frame representing a speech pause. Smoothing of the lower 
limit of the a priori SNR values is performed using a first order recursive system 
to provide smooth transitions between active speech and speech pause 
segments of the signal. 

An embodiment of the invention may also provide for reduced delay of 
coded speech data that can be caused by the enhancement preprocessor in 
combination with a speech coder. Delay of the enhancement preprocessor and 
coder can be reduced by having the coder operate, at least partially, on 
incomplete data samples to extract at least some coder parameters. The total 
delay imposed by the preprocessor and coder is usually equal to the sum of the 
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delay of the coder and the length of overlapping portions of frames in the 
enhancement preprocessor. However, the invention takes advantage of the fact 
that some coders store "look-ahead" data samples in an input buffer and use 
these samples to extract coder parameters. The look-ahead samples typically 
5 have less influence on the quality of coded speech than other samples in the 
input buffer. Thus, in some cases, the coder does not need to wait for a fully 
processed, /.e. t complete, data frame from the preprocessor, but instead can 
extract coder parameters from incomplete data samples in the input buffer. By 
operating on incomplete data samples, delay of the enhancement preprocessor 
10 and coder can be reduced without significantly affecting the quality of the coded 
data. 

For example, delay in a speech preprocessor and speech coder 
combination can be reduced by multiplying an input frame by an analysis window 
and enhancing the frame in the enhancement preprocessor. After the frame is 

15 enhanced, the left half of the frame is multiplied by a synthesis window and the 
right half is multiplied by an inverse analysis window. The synthesis window can 
be different from the analysis window, but preferably is the same as the analysis 
window. The frame is then added to the speech coder input buffer, and coder 
parameters are extracted using the frame. After coder parameters are extracted, 

20 the right half of the frame in the speech coder input buffer is multiplied by the 
analysis and the synthesis window, and the frame is shifted in the input buffer 
before the next frame is input. The analysis windows, and synthesis window 
used to process the frame in the coder input buffer can be the same as the 
analysis and synthesis windows used in the enhancement preprocessor, or can 

25 be slightly different, e.g., the square root of the analysis window used in the 

preprocessor. Thus, the delay imposed by the preprocessor can be reduced to a 
very small level, e.g., 1-2 milliseconds. 

These and other aspects of the invention will be appreciated and/or 
obvious in view of the following description of the invention. 

30 
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Brief Description of the Drawing s 

The invention is described in connection with the following drawings 
where reference numerals indicate like elements and wherein: 

Figure 1 is a schematic block diagram of an illustrative embodiment of the 
invention. 

Figure 2 is a flowchart of steps for a method of processing speech and 
other signals in accordance with the embodiment of Figure 1 . 

Figure 3 is a flowchart of steps for a method for enhancing speech signals 
in accordance with the embodiment of Figure 1 . 

Figure 4 is a flowchart of steps for a method of adaptively adjusting an a 
priori SNR value in accordance with the embodiment of Figure 1 . 

Figure 5 is a flowchart of the steps for a method of applying a limit to the a 
priori signal to noise ratio for use in a gain computation. 



Detailed Description 

A. Introduction to Illustrative Embodiments 

As is conventional in the speech coding art, the illustrative embodiment of 
the present invention is presented as comprising individual functional blocks (or 
"modules"). The functions these blocks represent may be provided through the 
use of either shared or dedicated hardware, including, but not limited to, 
hardware capable of executing software. For example, the functions of blocks 1- 
5 presented in Figure 1 may be provided by a single shared processor. (Use of 
the term "processor" should not be construed to refer exclusively to hardware 
capable of executing software.) 
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Illustrative embodiments may be realized with digital signal processor 
(DSP) or general purpose personal computer (PC) hardware, available from any 
of a number of manufacturers, read-only memory (ROM) for storing software 
performing the operations discussed below, and random access memory (RAM) 
5 for storing DSP/PC results. Very large scale integration (VLSI) hardware 

embodiments, as well as custom VLSI circuitry in combination with a general 
purpose DSP/PC circuit, may also be provided. 

Illustrative software for performing the functions presented in Figure 1 is 
provided in the Software Appendix hereto. 

10 B. The Illustrative Embodiment 

Figure 1 presents a schematic block diagram of an illustrative 
embodiment 8 of the invention. As shown in Figure 1, the illustrative 
embodiment processes various signals representing speech information. These 
signals include a speech signal (which includes a pure speech component, s(k), 

15 and a background noise component, n(k)), data frames thereof, spectral 

magnitudes, spectral phases, and coded speech. In this example, the speech 
signal is enhanced by a speech enhancement preprocessor 8 and then coded by 
a coder 7. The coder 7 in this illustrative embodiment is a 2400 bps MIL 
Standard MELP coder, such as that described in A. McCree et al., M A 2.4 KBIT/S 

20 MELP Coder Candidate for the New U.S. Federal Standard," Proa, IEEE Intl. 

Conf. Acoustics, Speech, Signal Processing (ICASSP), pp. 200-203, 1996, which 
is hereby incorporated by reference in its entirety. Figures 2, 3, 4, and 5 present 
flow diagrams of the processes carried out by the modules presented in Figure 1 . 

1 . The Segmentation Module 

25 The speech signal, s(k) + n(k), is input into a segmentation module 1 . 

The segmentation module 1 segments the speech signal into frames of 256 
samples of speech and noise data (see step 100 of Figure 2; the size of the data 
frame can be any desired size, such as the illustrative 256 samples), and applies 
an analysis window to the frames prior to transforming the frames into the 
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frequency domain (see step 200 of Figure 2). As is well known, applying the 
analysis window to the frame affects the spectral representation of the speech 
signal. 

The analysis window is tapered at both ends to reduce cross talk between 
subbands in the frame. Providing a long taper for the analysis window 
significantly reduces cross talk, but can result in increased delay of the 
preprocessor and coder combination 10. The delay inherent in the 
preprocessing and coding operations can be minimized when the frame advance 
(or a multiple thereof) of the enhancement preprocessor 8 matches the frame 
advance of the coder 7. However, as the shift between later synthesized frames 
in the enhancement preprocessor 8 increases from the typical half-overlap (e.g., 
128 samples) to the typical frame shift of the coder 7 (e.g., 180 samples), 
transitions between adjacent frames of the enhanced speech signal s(k) become 
less smooth. These discontinuities arise because the analysis window 
attenuates the input signal most at the edges of each frame and the estimation 
errors within each frame tend to spread out evenly over the entire frame. This 
leads to larger relative errors at the frame boundaries, and the resulting 
discontinuities, which are most notable for low SNR conditions, can lead to pitch 
estimation errors, for example. 

Discontinuities may be greatly reduced if both an analysis and synthesis 
windows are used in the enhancement preprocessor 8. For example, the square 
root of the Tukey window 



gives good performance when used as both an analysis and a synthesis window. 
M is the frame size in samples and M Q is the length of overlapping sections of 
adjacent synthesis frames. 



v '0.5(l-cos(/ri/M 0 )) 



for I < i < M 0 

for M - M 0 < i < M 



w(i) = v 0.5(1 - cos(;r(M - i) / M 0 )) 



otherwise 



(1) 
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Windowed frames of speech data are next enhanced. This enhancement 
step is referenced generally as step 300 of Figure 2 and more particularly as the 
sequence of steps in Figures 3, 4, and 5. 

2. The Transform Module 

The windowed frames of the speech signal are output to a transform 
module 2, which applies a conventional fast Fourier transform (FFT) to the frame 
(see step 310 of Figure 3). Spectral magnitudes output by the transform module 
2 are used by a noise estimation module 3 to estimate the level of noise in the 
frame. 

3. The Noise Estimation Module 

The noise estimation module 3 receives as input the spectral magnitudes 
output by the transform module 2 and generates a noise estimate for output to 
the gain function module 4 (see step 320 of Figure 3). The noise estimate 
includes conventionally computed a priori and a posteriori SNRs. The noise 
estimation module 3 can be realized with any conventional noise estimation 
technique, and may be realized in accordance with the noise estimation 
technique presented in the above-referenced U.S. Provisional Application No. 
60/1 1 9,279, filed February 9, 1 999. 

4. The Gain Function Module 

To prevent musical distortions and avoid distorting the overall spectral 
shape of speech sounds (and thus avoid disturbing the estimation of spectral 
parameters), the lower limit of the gain, G, must be set to a first value for frames 
which represent background noise only (a speech pause) and to a second lower 
value for frames which represent active speech. These limits and the gain are 
determined illustratively as follows. 

4.1 Limiting the a priori SNR 
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The gain function, G, determined by module 4 is a function of an a priori 
SNR value c k and an a posteriori SNR value y k (referenced above). The a priori 
SNR value c k is adaptively limited by the gain function module 4 based on 
whether the current frame contains speech and noise or noise only, and based 
5 on an estimated long term SNR for the speech data. If the current frame 

contains noise only (see step 331 of Figure 4), a preliminary lower limit 4mini(>-) = 
0.12 is preferably set for the a priori SNR value q k (see step 332 of Figure 4). If 
the current frame contains speech and noise (i.e., active speech), the preliminary 
lower limit 4mini(M is set to 

10 ^nim (A)=0.12exp(-5)(0.5 + SNR LT (/0) 065 (3) 

where SNR L t is the long term SNR for the speech data, and k is the frame index 
for the current frame (see step 333 of Figure 4). However, 4 mi m is limited to be 
no greater than 0.25 (see steps 334 and 335 of Figure 4). The long term SNR L t 
is determined by generating the ratio of the average power of the speech signal 
15 to the average power of the noise over multiple frames and subtracting 1 from 
the generated ratio. Preferably, the speech signal and the noise are averaged 
over a number of frames that represent 1-2 seconds of the signal. If the SNR LT 
is less than 0, the SNR U t is set equal to 0. 

The actual lower limit for the a priori SNR is determined by a first order 
20 recursive filter: 

Smm(X) = 0.9^niin(X-1) + 0.1* mln i(>0 (4) 

This filter provides for a smooth transition between the preliminary values for 
speech frames and noise only frames (see step 336 of Figure 4). The smoothed 
lower limit c m j n (/-) is then used as the lower limit for the a priori SNR value l k (k) in 
25 the gain computation discussed below. 

4.2 Determining the Gain with a Limited a priori SNR 

9 
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As is known in the art, gain, G, used in speech enhancement 
preprocessors is a function of the a priori signal to noise ratio, 4. an d the a 
posteriori SNR value, y. That is, G k = fUk('0<7k(A.))« where /. is the frame index 
and k is the subband index. In accordance with an embodiment of this invention, 
the lower limit of the a priori SNR, 4min(>0. is applied to the a priori SNR (which is 
determined by noise estimation module 3) the as follows: 

Ck(X) = ZkM if > Cmin(A) 

Ck(X) = Cmin(>w) if 4k(>-) ^ Cmin(>.) 

(see steps 510 and 520 of Figure 5). 

Based on the a posteriori SNR estimation generated by the noise 
estimation module 3 and the limited a priori SNR discussed above, the gain 
function module 4 determines a gain function, G (see step 530 Figure 5). A 
suitable gain function for use in realizing this embodiment is a conventional 
Minimum Mean Square Error Log Spectral Amplitude estimator (MMSE LSA), 
such as the one described in Y. Ephraim et al., "Speech Enhancement Using a 
Minimum Mean-Square Error Log-Spectral Amplitude Estimator," IEEE Trans. 
Acoustics, Speech and Signal Processing, Vol. 33, pp. 443-445, April 1985, 
which is hereby incorporated by reference as if set forth fully herein. Further 
improvement can be obtained by using a multiplicatively modified MMSE LSA 
estimator, such as that described in D. Malah, et al., "Tracking Speech Presence 
Uncertainty to Improve Speech Enhancement in Non-Stationary Noise 
Environments," Proc. ICASSP, 1999, to account for the probability of speech 
presence. This reference is incorporated by reference as if set forth fully herein. 

5. Applying the Gain Function 

The gain, G, is applied to the noisy spectral magnitudes of the data frame 
output by the transform module 2. This is done in conventional fashion by 
multiplying the noisy spectral magnitudes by the gain, as shown in Figure 1 (see 
step 340 of Figure 3). 
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6. The Inverse Transform Module 

A conventional inverse FFT is applied to the enhanced spectral 
amplitudes by the inverse transform module 5, which outputs a frame of 
enhanced speech to an overlap/add module 6 (see step 350 of Figure 3). 

7. Overlap Add Module; Delay Reduction 

The overlap/add module 6 synthesizes the output of the inverse transform 
module 5 and outputs the enhanced speech signal s(k) to the coder 7. 
Preferably, the overlap/add module 6 reduces the delay imposed by the 
enhancement preprocessor 8 by multiplying the left "hair (e.g., the less current 
180 samples) in the frame by a synthesis window and the right half (e.g., the 
more current 76 samples) in the frame by an inverse analysis window (see step 
400 of Figure 2). The synthesis window can be different from the analysis 
window, but preferably is the same as the analysis window (in addition, these 
windows are preferably the same as the analysis window referenced in step 200 
of Figure 2). The sample sizes of the left and right "halves" of the frame will vary 
based on the amount of data shift that occurs in the coder 7 input buffer as 
discussed below (see the discussion relating to step 800, below). In this case, 
the data in the coder 7 input buffer is shifted by 180 samples. Thus, the left half 
of the frame includes 180 samples. Since the analysis/synthesis windows have 
a high attenuation at the frame edges, multiplying the frame by the inverse 
analysis filter will greatly amplify estimation errors at the frame boundaries. 
Thus, a small delay of 2-3 ms is preferably provided so that the inverse analysis 
filter is not multiplied by the last 1 6-24 samples of the frame. 

Once the frame is adjusted by the synthesis and inverse analysis 
windows, the frame is then provided to the input buffer (not shown) of the coder 
7 (see step 500 of Figure 2). The left portion of the current frame is overlapped 
with the right half of the previous frame that is already loaded into the input 
buffer. The right portion of the current frame, however, is not overlapped with 
any frame or portion of a frame in the input buffer. The coder 7 then uses the 
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data in the input buffer, including the newly input frame and the incomplete right 
half data, to extract coding parameters (see step 600 of Figure 2). For example, 
a conventional MELP coder extracts 10 linear prediction coefficients, 2 gain 
factors, 1 pitch value, 5 bandpass voicing strength values, 10 Fourier 
magnitudes, and an aperiodic flag from data in its input buffer. However, any 
desired information can be extracted from the frame. Since the MELP coder 7 
does not use the latest 60 samples in the input buffer for the Linear Predictive 
Coefficient (LPC) analysis or computation of the first gain factor, any 
enhancement errors in these samples have a low impact on the overall 
performance of the coder 7. 

After the coder 7 extracts coding parameters, the right half of the last 
input frame (e.g., the more current 76 samples) are multiplied by the analysis 
and synthesis windows (see step 700 of Figure 2). These analysis and synthesis 
windows are preferably the same as those referenced in step 200, above 
(however, they could be different, such as the square-root of the analysis window 
of step 200). 

Next, the data in the input buffer is shifted in preparation for input of the 
next frame, e.g., the data is shifted by 180 samples (see step 800 of Figure 2). 
As discussed above, the analysis and synthesis windows can be the same as 
the analysis window used in the enhancement preprocessor 8, or can be 
different from the analysis window, e.g., the square root of the analysis window. 
By shifting the final part of overlap/add operations into the coder 7 input buffer, 
the delay of the enhancement preprocessor 8/coder 7 combination can be 
reduced to 2-3 milliseconds without sacrificing spectral resolution or cross talk 
reduction in the enhancement preprocessors. 

C. Discussion 

While the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alternatives, modifications and 
variations will be apparent to those skilled in the art. Accordingly, the preferred 
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embodiments of the invention as set forth herein are intended to be illustrative, 
not limiting. Various changes may be made without departing from the spirit and 
scope of the invention. 

For example, while the illustrative embodiment of the present invention is 
presented as operating in conjunction with a conventional MELP speech coder, 
other speech coders can be used in conjunction with the invention; 

The illustrative embodiment of the present invention employs an FFT and 
IFFT, however, other transforms may be used in realizing the present invention, 
such as a discrete Fourier transform (DFT) and inverse DFT. 

While the noise estimation technique in the referenced provisional patent 
application is suitable for the noise estimation module 3, other algorithms may 
also be used such as those based on voice activity detection or a spectral 
minimum tracking approach, such as described in D. Malah et al., 'Tracking 
Speech Presence Uncertainty to Improve Speech Enhancement in Non- 
Stationary Noise Environments," Proc. IEEE Intl. Conf. Acoustics, Speech, 
Signal Processing (ICASSP). 1999; or R. Martin, "Spectral Subtraction Based on 
Minimum Statistics, H Proc. European Signal Processing Conference, vol. 1, 
1994, which are hereby incorporated by reference in their entirety. 

Although the preliminary lower limit 4m in i(/.) = 0.12 is preferably set for the 
a priori SNR value c k when a frame represents a speech pause (background 
noise only), this preliminary lower limit ^ min i could be set to other values as well. 

The process of limiting the a priori SNR is but one possible mechanism for 

limiting the gain values applied to the noisy spectral magnitudes. However, other 

methods of limiting the gain values could be employed. It is advantageous that 

the lower limit of the gain values for frames representing speech activity be less 

than the lower limit of the gain values for frames representing background noise 

only. However, this advantage could be achieved other ways, such as, for 

example, the direct limitation of gain values (rather than the limitation of a 

functional antecedent of the gain, like a priori SNR). 
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Although frames output from the inverse transform module 5 of the 
enhancement preprocessor 8 are preferably processed as described above to 
reduce the delay imposed by the enhancement preprocessor 8, this delay 
reduction processing is not required to accomplish enhancement. Thus, the 
enhancement preprocessor 8 could operate to enhance the speech signal 
through gain limitation as illustratively discussed above (for example, by 
adaptively limiting the a priori SNR value c k ). Likewise, delay reduction as 
illustratively discussed above does not require use of the gain limitation process. 

Delay in other types of data processing operations can be reduced by 
applying a first process on a first portion of a data frame, i.e., any group of data, 
and applying a second process to a second portion of the data frame. The first 
and second processes could involve any desired processing, including 
enhancement processing. Next, the frame is combined with other data so that 
the first portion of the frame is combined with other data. Information, such as 
coding parameters, are extracted from the frame including the combined data. 
After the information is extracted, a third process is applied to the second portion 
of the frame in preparation for combination with data in another frame. 
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WHAT IS CLAIMED IS : 

1 . A method for enhancing a speech signal for use in speech coding, 
the speech signal representing background noise and periods of articulated 
speech, the speech signal being divided into a plurality of data frames, the 
method comprising the steps of: 

applying a transform to the speech signal of a data frame to generate a 
plurality of sub-band speech signals; 

making a determination whether the speech signal corresponding to the 
data frame represents articulated speech; 

applying individual gain values to individual sub-band speech signals, 
wherein the lowest permissible gain value for a frame determined to represent 
articulated speech is lower than the lowest permissible gain value for a frame 
determined to represent background noise only; and 

applying an inverse transform to the plurality of sub-band speech signals. 

2. The method of claim 1 further comprising the step of determining 
the individual gain values and wherein the lowest permissible gain value is a 
function of a lowest permissible a priori signal to noise ratio. 

3. A method for enhancing a signal for use in speech coding, the 
signal being divided into data frames and representing background noise 
information and periods of articulated speech information, the method 
comprising the steps of: 

making a determination whether the signal of a data frame represents 
articulated speech information; and 
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applying a gain value to the signal, wherein the lowest permissible gain 
value for a frame determined to represent articulated speech is lower than the 
lowest permissible gain value for a frame determined to represent background 
noise only. 

4. The method of claim 3 further comprising the step of determining 
the gain value and wherein the lowest permissible gain value is a function of a 
lowest permissible a priori signal to noise ratio. 
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